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REMARKS 

Claims 109-115, 117, 119-133, and 135-166 are currently pending. 
Claims 112-115, 117, and 128-130 are withdrawn. Claims 1-108, 116, 118, and 
134 are cancelled. Claims 109, 127, and 131 are currently amended. Claims 
143-166 are newly added. The new claims do not add new matter. Support for 
the claims may be found, in part, in claims 112-118, Figure 24, and paragraph 
[0056] of the published application. 

I. Summary of telephone conference 

Applicants appreciate the April 19 th phone conference with Examiner Bhat. 
During the phone conference, it was determined that one way of addressing the 
§103 rejections of record is to define R1 as an antibody, as Egholm et al. and 
Weston et al. do not disclose antibody epitope binding agents. Even though there 
are other differences between the cited art and the presently pending claims, to 
expedite prosecution, Applicants have so amended claim 109. 

Additionally, Examiner Bhat expressed concern over the free energy 
limitation of claim 109. Specifically, the Examiner asserted that the free energy 
limitation was (A) broad, and (B) does not identify a specific structural component 
for its calculation. Applicants disagree with both points, as detailed below. 

(a) the free energy limitation is not overly broad 

Claim 109 provides the range of acceptable free energy for the duplex 
comprising R3 and R7, i.e. about 5.5 kcal/mol to 8.0 kcal/mol. As stated in earlier 
Applicant responses, this free energy range is essential to maintaining the 
internal balance of the claimed biosensor, between a false-positive signal on the 
one hand, and lack of sensitivity on the other. Hence, this range is not broad or 
arbitrary, but rather, reflects the narrow range of free energy values across which 
the claimed biosensor will function. Stated another way, if R3 and R7 of a 
biosensor do not meet the free energy requirements of claim 109, the biosensor 
will not produce a detectable signal (i.e., the biosensor will not work). 
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To clarify this requirement, claim 109 has been amended to further reflect 
this internal balance. In particular, claim 109 states that "R3 and R7 are a pair of 
complementary nucleotide sequences having a free energy for association ... 
such that R3 and R7 only associate when R1 and R5 are bound to the target 
molecule ." This amendment explicitly incorporates the internal balance, 
discussed above, into the claim limitations. 

Additionally, claim 109 provides a researcher skilled in the art with the 
necessary parameters to calculate the free energy for R3 and R7. In the context 
of nucleic acid duplex formation, free energy depends on three parameters: 
temperature, salt concentration, and the nucleic acid sequence of the duplex. 
(See the § 1 .1 .32 declaration of Dr. Tomasz Heyduk: "For a given nucleic acid 
duplex, the free energy depends on three factors...") Claim 109 provides a skilled 
researcher with two of these three parameters, i.e. temperature and salt 
concentration. 1 As a result, a researcher would be able to determine, for a given 
sequence (i.e. the third parameter), if the sequence will have the necessary free 
energy within the applicable temperature and salt ranges of claim 109. 2 

The Office has expressed concern that variations in methods of 
determining free energy make it difficult to determine whether a given sequence 
would fall within R3 and R7 of claim 109. This concern is based, in part, on the 
Office's reliance on the PNAS paper by SantaLucia. Applicants assert that this 
reliance has misled the Examiner . Scientifically speaking, there is only one 
definitive way to determine free energy, and that is to actually perform the 
experiment. To circumvent performing this experiment, however, researchers 
have attempted to create a system for estimatinp free energy. These attempts 
have been summarized, in part, in the Santa Lucia PNAS paper, which is 
directed to the nearest neighbor method of estimating free energy. 

The nearest neighbor theory is based on the premise that the free energy 
of a given complementary base pair is contingent on the context of that base pair 



1 Heyduk declaration, at point 3a. 

2 Id., at point 3c. 
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in the surrounding nucleic acid sequence, i.e., the neighbors. 3 From this basic 
premise, different groups used different means to come up with a system for 
estimating free energy. Hence, the different sets of data summarized in Table 1 
of the SantaLucia PNAS paper do NOT represent different methods of 
experimentally calculating free energy, but rather, represent different 
estimations of nearest neighbor parameters. Stated another way, the 
ambiguity the Office was attributing to the METHOD of calculating free 
energy is instead properly attributed to ESTIMATIONS of nearest neighbor 
parameters . For instance, the abstract of the SantaLucia PNAS paper explains 
that the different values in Table 1 come from different methods of determining 
the estimations: 

The seven studies [represented in Table 1] used data from 
natural polymers, synthetic polymers, oligonucleotide 
dumbbells, and oligonucleotide duplexes to derive [nearest 
neighbor] parameters; used different methods of data analysis; 
used different salt concentrations; and presented the [nearest 
neighbor] thermodynamics in different formats. 

Hence, the different values in Table 1 are solely attributed to different 
methods of estimating free energy, as opposed to different methods of 
experimentally determining free energy. In fact, scientifically speaking, there is 
only one definitive way to determine free energy, and that is to actually perform 
the experiment. All other methods are designed to provide "estimations," not 
actual free energy values. As a result, Applicants assert on the record that the 
free energy for a given nucleotide sequence should be directly determined, via 
experiment, for the purposes of ascertaining whether the sequence falls within 
the range of claim 109. This is consistent with the state of the art, which is 
evidenced in part by the § 1 .132 declaration by Dr. Tomasz Heyduk submitted 
herewith. Furthermore, by making this statement on the record, Applicants are 
bound by the theory of prosecution history estoppel. Consequently, in any future 



3 Id., at point 4. 
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disputes over claim construction, Applicants will be held to experimentally 
determining free energy. 

(b) The free energy requirement is tied to a structural component of the 
biosensor. 

The Office has asserted that the free energy limitation of claim 109 does 
not identify a specific structural component. Applicants disagree. 

The free energy limitation of claim 109 is specifically tied to the 
composition of R3 and R7 and meaningfully limits the composition of R3 and R7. 
As stated above, if the free energy for R3 and R7 falls outside the free energy 
range of claim 109, the biosensor will not work. In essence, the free energy 
limitation on R3 and R7 goes to the heart of the invention. 

Furthermore, one of skill in the art, given the temperature and salt 
conditions highlighted in claim 109, would be able to determine whether a given 
sequence for R3 and R7 would possesses a free energy within the claimed 
range, by experimentally determining the free energy as described above. 

II. Declarations previous filed 

Applicants would like to clarify a statement made by the Office in regard to 
the Heyduk declaration filed on September 1 1 , 2009. Specifically, the Office 
states at page 3 of the Office Action mailed 12/18/2009 that: "it is not clear which 
of these components contribute or inhibit or control the unexpected signal 
production or the R3/R7 pair having free energy of association between about 
5.5 kcal/mole and 8.0 kcal/mole..." 

As stated in the response filed September 1 1 , 2009, on pages 18 and 19, 
the only difference between the two biosensors used in the declaration 
experiment was the sequence of R3/R7. Stated another way, the other 
parameters listed by the Office (i.e. antibodies, Tris buffer, EDTA, and BSA) were 
identical between the two sensors , and therefore, could not have 
contributed to the unexpected results . The experiment was designed in this 
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manner to illustrate the importance of the free energy range to the functioning of 
a biosensor of claim 109. The Baez sequence, which had a free energy outside 
of the claimed range, did not produce a functional biosensor, as illustrated by the 
diagram in the September 1 1 th declaration. 

III. Claim interpretation, 35 USC 112, sixth paragraph 

Applicants agree with the Office's interpretation of "detection means" in 
claims 109, 127, and 131. 

IV. § 103 rejection in view of Egholm et al. and Weston et al. 

Reconsideration is requested of the rejection of claims 109-110, 119-127, 
131-132, and 135-142 under 35 USC § 103(a) in view of Egholm et al. and 
Weston et al. 

Amended claim 109 is directed to a biosensor comprising two constructs 
(R1-R2-R3-R4 and R5-R6-R7-R8). Claim 109 requires that R1 and R5 are 
antibodies. The Office has previously acknowledged that neither Egholm nor 
Weston disclose a reporter where R1 or R5 are antibodies. As a result, 
Applicants request withdrawal of the rejection in light of Egholm and Weston. 

Specifically, Egholm et al. discloses a reporter that recognizes and detects 
nucleic acid targets. The Egholm reporter only uses nucleic acid as an epitope 
binding agent, because the Egholm reporter relies on hybridization between the 
nucleic acid epitope binding agent and the target to produce a signal. Nowhere 
does Egholm teach or suggest an antibody based epitope binding agent, as 
required by claim 109. 

Similarly, Weston et al. discloses a reporter that recognizes and detects 
only nucleic acid targets. The Weston reporter only uses nucleic acid as an 
epitope binding agent, because the Weston reporter relies on hybridization 
between the nucleic acid target and the nucleic acid epitope binding agent to 
produce a signal. Nowhere does Weston teach or suggest an antibody based 
epitope binding agent as required by claim 109. 
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Three criteria must be present to establish a prima facie case of 
obviousness. 4 First, the prior art reference must teach or suggest all the claim 
limitations. Second, there must be some suggestion or motivation in the 
knowledge generally available to one of ordinary skill in the art to modify the 
reference. Third, there must be a reasonable expectation of success. 5 Not one of 
these three criteria is satisfied by the combination of Egholm et al. and Weston et 
al. cited by the Office. 

(a) the cited art does not disclose antibody epitope binding agents 

As discussed above, neither Egholm nor Weston disclose R1 and R5 of 
the present claims. Specifically, neither Egholm nor Weston disclose a biosensor 
where R1 and R5 are antibodies. Hence, Egholm and Weston, whether taken 
individually or combined, fail to disclose each element of the pending 
independent claims. 

(b) no motivation or reasonable expectation of success 

As neither Egholm nor Weston disclose a reporter that comprises R1 and 
R5 of the present claims, there is no motivation to combine them to create a 
sensor where R1 and R5 are antibodies, nor is there any reasonable expectation 
of success in doing so. Both the Egholm and the Weston reporter recognize their 
target via nucleic acid hybridization, necessitating that the epitope binding agent 
in both references is a nucleic acid. Hence, there is no motivation in the cited art 
to combine Egholm and Weston to arrive at the biosensor of claim 109. 

(c) dependent claims 

Claims 1 1 0 and 1 1 9-1 26 depend from claim 1 09 and necessarily 
incorporate each limitation of claim 109. As a result, Egholm and Weston do not 



4 MPEP §2143 

5 Id. 
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render claims 1 1 0, and 1 1 9-1 26 obvious for the same reasons as cited above 
with respect to claim 109. 

Claims 132 and 135-142 depend from claim 131 and necessarily 
incorporate each limitation of claim 131. Specifically, claims 132 and 135-142 
each incorporate the limitation that R1 and R5 are antibodies. Egholm and 
Weston only disclose nucleic acid epitope binding agents. As a result, Egholm 
and Weston do not render claims 132 and 135-142 obvious for the same reasons 
as cited above with respect to claim 132. 

Consequently, Applicants respectfully request withdrawal of the rejection 
of claims 1 09-1 10,11 9-1 27, 1 31 -1 32, and 1 35-1 42 as obvious in light of Egholm 
and Weston. As this is the only remaining rejection for claims 110, 1 19-127, 132, 
and 135-142, Applicants believe these claims to be in condition for allowance. 

(d) new claims 

New claims 143-154 encompass a biosensor where R1 and R5 recognize 
a non-nucleic acid target molecule. As detailed above, Egholm et al. and Weston 
et al. only teach nucleic acid targets. As a result, Egholm et al. and Weston et al. 
do not render new claims 143 -154 obvious. 

New claims 155-166 encompass a biosensor where R1 is an antibody 
epitope binding agent and R5 is an epitope binding agent. As detailed above, 
Egholm et al. and Weston et al. only teach nucleic acid epitope binding agents. 
As a result, Egholm et al. and Weston et al. do not render new claims 155 -166 
obvious. 

V. §103 rejection in view of Egholm et al. and Weston et al. and Baez et al. 

Reconsideration is requested of the rejection of claims 1 09, 111, 116, 131, 
133, and 134 under 35 USC §1 03(a) in view of Egholm et al., Weston et al., and 
Baez et al. 

As stated above, Egholm et al. and Weston et al. do not disclose antibody 
epitope binding agents. To address this shortcoming, the Office has resorted to 
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combining Egholm et al. and Weston et al. with the Baez application. Resort to 
Baez, however, does not cure the defect in the Office's obviousness rejection. 
Specifically, as detailed below, Egholm et al. does not disclose an R3 that meets 
the free energy requirements of claim 109, and as acknowledged by the Office, 
Baez does not disclose an equivalent to the R2-R3 portion of claim 1 09. Hence, 
even in combination, Egholm et al., Weston et al, and Baez et al. do not disclose 
each limitation of claim 109. 

(a) there is no prima facie case of obviousness 

Three criteria must be present to establish a prima facie case of 
obviousness. 6 First, the prior art reference must teach or suggest all the claim 
limitations. Second, there must be some suggestion or motivation in the 
knowledge generally available to one of ordinary skill in the art to modify the 
reference. Third, there must be a reasonable expectation of success. 7 Not one of 
these three criteria is satisfied by the combination of Egholm et al., Weston et al., 
and Baez et al. as cited by the Office. 

/. the cited references DO NOT disclose all limitations of claims 109 and 131 

As summarized above, and detailed in the response filed September 1 1 , 
2009, the Baez application fails to disclose an equivalent to R3. 

While Egholm et al. discloses a reporter that can detect a nucleic acid 
sequence, Egholm et al. DOES NOT disclose R3 of claim 109 as asserted by the 
Office. Specifically, Egholm et al. DOES NOT disclose a complementary 
nucleotide sequence having a free energy for association over the entire length 
of the nucleotide sequence, from about 5.5 kcal/mole to 8.0 kcal/mole. Rather, 
Egholm describes a complementary nucleic acid sequence that comprises 
peptide nucleic acid (PNA) . The Office has mistakenly assumed that the 
sequence TGACTGAC disclosed in Figure 7 of the Egholm application is DNA. 



6 MPEP §2143 
7 Id. 
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IT IS NOT. Instead, the sequence is PNA, as detailed in Figure 1 1 , and Example 
2 (lines 1 0-1 1 ). As a result, the free energy estimated by the Office IS NOT 
CORRECT , because the free energy of a PNA/PNA duplex IS NOT THE SAME 
as the free energy of a DNA/DNA duplex. This is illustrated, in part, by the data in 
Table 1 of the attached Sen et al. reference. 

Table 1 of Sen et al. shows that a PNA/PNA duplex DOES NOT HAVE the 
same free energy as a DNA/DNA duplex with the same sequence. In particular, 
the PNA/PNA duplex has a free energy that is ~2x the free energy of the identical 
DNA/DNA duplex in the absence of an organic solvent. The Office has cited to 
SantaLucia et al. as support for the free energy value erroneously assigned to 
Egholm et al., but SantaLucia is ONLY APPLICABLE TO DNA, not PNA. As a 
result, the Office has not established that Egholm et al. discloses a 
complementary sequence with a free energy from about 5.5 kcal/mole to 8.0 
kcal/mole. 

Hence, Egholm et al. and Weston et al., whether taken by themselves, or 
in combination with Baez et al., do not disclose each limitation of claim 109. 

//. no motivation or reasonable expectation of success 

Not only do the references not disclose each limitation of claim 109, but 
the cited references provide no motivation to modify the references to arrive at 
the biosensor of claim 109. Specifically, none of the cited references disclose or 
suggest the importance of the free energy range restriction on R3. As a result, 
the references provide no motivation to modify any of the cited references to 
arrive at the required R2-R3 of claim 109. 

The Office has asserted that a skilled artisan would be motivated to 
"modify the epitope binding agent of Egholm et al. with the expected benefit of 
improving the ratio of analyte specific signal to analyte non-specific background 
signal thereby enhancing analyte detection at very low concentration as taught 
by Baez et al." This assertion, however, is incorrect for at least two reasons. 
First , even if Baez is used to modify the epitope binding agents of Egholm, the 
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Office still has not established a prima facie case of obviousness, because 
neither Egholm nor Baez disclose R2-R3. Second , the Office is quoting Baez out 
of context. Paragraph [0017], read in context with the previous paragraphs, does 
not call out the antibody epitope binding agent as a means of improving signal. 
Rather, it calls out the Baez method of signal amplification as a means to 
improve signal. This is apparent from reading paragraphs [0014] - [0017]: 

[N]on-specific binding-signal due to non-selective binding of 
reporter conjugates to walls of the reaction tubes or to solid- 
phase reagents used in the assays even in the absence of 
analyte, is a serious problem in immunoassays . ... Despite 
numerous attempts in this art ... non-specific binding has not 
been eliminated. 

Hence, the improved signal referenced in paragraph [0017] of Baez is not 
because of the use of antibodies, but due to the claim that the Baez reporter 
eliminates non-specific signal derived from reporter conjugates binding to the 
solid surfaces of the assay. As a result, there is no motivation to combine the 
references to arrive at the biosensor of claim 109. 

Furthermore, there is no reasonable expectation of combining the 
references to arrive at a biosensor of claim 109. Egholm et al. is directed solely 
to detecting nucleic acid targets. Generally speaking, antibodies do not recognize 
nucleic acid targets with the same affinity or specificity as a complementary 
sequence. As a result, a skilled artisan would not predict that the hybridization 
schemes illustrated and taught by Egholm would work if the nucleic acid epitope 
binding agent was replaced with an antibody of Baez. 

In conclusion, the Office has not set forth a prima facie case of 
obviousness because the cited references do not disclose each element of claim 
109, provide no motivation for modifying the cited references to arrive at claim 
109, and there is no reasonable expectation of success in modifying the 
references. Consequently, Applicants respectfully request withdrawal of the 
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rejection of claim 109 as obvious in view of the combination of Egholm et al., 
Weston et al., and Baez et al. 

(b) claims 110, 119-126, 131, 132, 135-142 

Claims 1 1 0 and 1 1 9-1 26 depend from claim 1 09 and necessarily 
incorporate each limitation of claim 109. As a result, Egholm, Weston, and Baez 
do not render claims 1 1 0, and 1 1 9-1 26 obvious for the same reasons as cited 
above with respect to claim 109. 

Claims 132 and 135-142 depend from claim 131 and necessarily 
incorporate each limitation of claim 131. Claim 131, and as a result claims 132 
and 135-142, each incorporate the limitations on R1, R2, R3, R4, R5, R6, R7, 
and R8 found in claim 109. As detailed above, the cited combination of art does 
not disclose R2-R3 of claim 109. As a result, Egholm, Weston, and Baez do not 
render claims 1 31 , 1 32 and 1 35-1 42 obvious for the same reasons as cited 
above with respect to claim 1 09. 

Consequently, Applicants respectfully request withdrawal of the rejection 
of claims 109-110, 119-127, 131-132, and 135-142 as obvious in light of the 
combination of Egholm et al., Weston et al. and Baez. 

(c) new claims 

New claims 143-154 encompass a biosensor where R1 and R5 recognize 
a non-nucleic acid target molecule. R2 and R3 of the new claims are identical to 
claim 109. As detailed above, the combination of Egholm et al., Weston et al., and 
Baez et al. do not teach R2-R3. As a result, the cited art does not render new 
claims 143 -154 obvious. 

New claims 155-166 encompass a biosensor where R1 is an antibody 
epitope binding agent and R5 is an epitope binding agent. As detailed above, 
Egholm et al. and Weston et al. only teach nucleic acid epitope binding agents, 
and the combination of Egholm et al., Weston et al., and Baez et al. do not teach 
R2-R3. As a result, the cited art does not render new claims 155 -166 obvious. 
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CONCLUSION 

In light of the foregoing, applicants request entry of the claim 
amendments, withdrawal of the claim rejections, and solicit an allowance of the 
claims. The Examiner is invited to contact the undersigned attorney should any 
issues remain unresolved. 



Respectfully submitted, 
Polsinelli Shughart PC 



Date: June 18, 2010 By: /Rebecca C. Rilev-Varqas/ 

Rebecca C. Riley-Vargas, Reg. No. 60,046 

1 10 South Fourth Street, Suite 1 100 

St. Louis, MO 63102 

Phone -314-889-8000 

Fax -314-231-1776 

Attorney 

RCR/sat 



453251.2 



26 



Biochemistry 1996, 35, 3555-3562 



3555 



Improved Nearest-Neighbor Parameters for Predicting DNA Duplex Stability 1 

John SantaLucia, Jr.,* Hatim T. Allawi, and P. Ananda Seneviratne 
Department of Chemistry, Wayne State University, Detroit, Michigan 48202 
Received August 14, 1995; Revised Manuscript Received December 1, 1995 s 



abstract: Thermodynamic data were determined from UV absorbance vs temperature profiles of 23 
oligonucleotides. These data were combined with data from the literature for 21 sequences to derive 
improved parameters for the 10 Watson-Crick nearest neighbors. The observed trend in nearest-neighbor 
stabilities at 37 °C is GC > CG > GG > GA as GT % CA > CT > AA > AT > TA (where only the 
top strand is shown for each nearest neighbor). This trend suggests that both sequence and base composition 
are important determinants of DNA duplex stability. On average, the improved parameters predict AG°37, 
A// 0 , AS°, and T M within 4%, 7%, 8%, and 2 °C, respectively. The parameters are optimized for the 
prediction of oligonucleotides dissolved in 1 M NaCi. 



Accurate prediction of DNA thermal denaturation is 
important for several molecular biological techniques includ- 
ing PCPJ (Saiki et al., 1988), sequencing by hybridization 
(Fodor et al„ 1993), antigene targeting (Freier, 1993), and 
Southern blotting (Southern, 1975). In these techniques, 
choice of a noiioptimal sequence or temperature can lead to 
amplification or detection of wrong sequences (Steger, 1994). 
Furthermore, knowledge of the sequence dependence of 
DNA melting is important for understanding the details of 
DNA replication, mutation, repair, and transcription (Men- 
delman et al., 1989; Petruska et al., 1988). 

One widely used mediod for predicting nucleic acid duplex 
stability, pioneered by Tinoco and co-workers (Borer et al., 
1974), uses a nearest-neighbor model for helix propagation. 
Several nearest-neighbor parameter sets for predicting DNA 
duplex stability are available in the literature (Gotoh & 
Tagashira, 1981; Ornsiein & Fresco, 1983; Vologodskii et 
al., 1984; Wartell & Benight, 1985; Breslauer et al., 1986; 
Aida, 198S; Otto, 1989; Quartin & Wetmur, 1989; Klump, 
1990; Delcourt & Blake, 1991; Doktycz et al„ 1992). The 
quantum mechanical studies (Ornstein & Fresco, 1983; Aida, 
1988; Otto, 1989) were performed in the gas phase and 
neglected solvent and counterion interactions and, thus, do 
not reflect the conditions typically found in vivo or in vitro. 
Data for the thermal denaturation of polymers are difficult 
to interpret properly since their transitions are typically not 
two-state (i.e., many unfolding intermediates are possible) 
and their melting temperatures are high. Thus, polymer 
melting is typically performed in solutions with low salt 
concentration and the thermodynamic results are extrapolated 
to the standard state temperature (25 or 37 °C) and higher 
salt concentration (Breslauer et al., 1986). In addition, 
polymer melting does not involve a bimolecular initiation 
event and is dependent on only eight invariants which are 
linear combinations of the 10 nearest neighbor parameters 
required to predict oligonucleotide thermodynamics (Gray 
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& Tinoco, 1970; Vologodskii et al„ 1984; Doktycz et al., 
1992). Thus, studies of polymer thermodynamics (Gotoh 
& Tagashira, 1981; Vologodskii et al., 1984; Wartell & 
Benight, 1985; Klump, 1990; Delcourt Si Blake, 1991) are 
most applicable for the prediction of polymer behavior but 
do not reliably predict oligonucleotide thermodynamics 
(Sugimoto et al., 1994). Thus, we decided to expand the 
DNA oligonucleotide thermodynamic database and derive 
new nearest-neighbor parameters in 1 M NaCl buffer. 

In this paper, thermodynamic measurements are reported 
for 26 oligonucleotides ranging in length from 4 to 16 base 
pairs. Thermodynamic data for 23 of these sequences are 
combined with data for 21 oligonucleotides from the 
literature to derive improved nearest-neighbor parameters. 
The parameters are able to predict the stabilities of DNA 
duplexes within the limits of the nearest-neighbor model. 

MATERIALS AND METHODS 

DNA Synthesis and Purification. Oligonucleotides were 
the gift of Hitachi Chemical Research and were synthesized 
on solid support using standard phosphoramidite chemistry 
(Brown & Brown, 1991). Oligomers were removed from 
solid support and base blocking groups were removed by 
treatment with concentrated ammonia at 50 °C overnight. 
Each sample was evaporated to dryness, dissolved in 250 
fih of water and purified on a Si500F thin-layer chroma- 
tography plate (Baker) by eluting for 5 h with «-propanol/ 
ammonia/water (55:35:10 by volume) (Chou et al., 1989). 
Bands were visualized with an ultraviolet lamp, and the least 
mobile band was cut out and eluted three times with 3 mL 
of distilled deionized water. The sample was then evaporated 
to dryness. Oligonucleotides were desalted and further 
purified with a Sep-pak C-18 cartridge (Waters). The DNA 
was eluted with 30% acetonitrile buffered with 10 mM 
ammonium bicarbonate at pH 7.0. The purity of oligo- 
nucleotides was checked by analytical C-S HPLC (Perceptive 
Biosystems) and was greater than 95%. 

Measurement of Melting Curves. Absorbance vs temper- 
ature profiles (melting curves) were measured with an Aviv 
14DS UV-vis spectrophotometer with a five cuvette ther- 
moelectric controller. Custom-manufactured microcuvettes 
(Hellma Cells) with 0.1, 0.2, 0.5, and 1.0 cm path lengths 
(60, 120, 300, and 600 /<L volumes, respectively) were used 
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so that melting curves could be measured with high sensitiv- 
ity over a 100-fold range in oligonucleotide concentration. 
Aluminum adapters were used to properly position micro- 
cuvettes in the light beam and provide optimal thermal 
contact with the thermoelectric cuvette holder. To prevent 
water condensation at low temperatures, the sample compart- 
ment was purged with dry nitrogen gas. 

The temperature was monitored with the temperature 
transducer (Analog Devices Inc.) mounted in the spindle of 
the Aviv thermoelectric cuvette holder. The temperature 
readings from the transducer were calibrated by measuring 
the voltage produced by a type K thermocouple inserted in 
a 1 cm microcuvette during a typical thermal denaturation 
run. We estimate the temperature measurement to be 
reproducible within 0.1 °C and accura tewjjhjn 03 °C. 

Oligonucleotides were dissolved in( LQ M NaCj7 }10 raM 
sodium cacodylate, and 0.5 mM Na^EDTA, pH^ buffer. 
Samples were "annealed" and degassed by raising the 
temperature to 85 °C for 5 min and then cooling to -1.6 °C 
over a period of 25 min just prior to a melting experiment. 
While at 85 °C, the absorbances were measured at 260 nm 
for later calculation of oligonucleotide concentration using 
extinction coefficients calculated from dinucleoside mono- 
phosphates and nucleotides, as described (Richards, 1975). 
Oligonucleotide concentration was varied over an 80-100- 
fold range. Samples were then heated at a constant rate of 
0.8 D C/min with data collection beginning at 0 °C and ending 
at 90-95 °C. Control experiments with a heating rate of 
0.4 °C/min gave same results as those obtained with 0.S "CI 
min indicating that thermal equilibrium was established. The 
duplex to coil tranistion was monitored by measuring the 
absorbance at 280 nm. Air was used for the reference light 
beam. The spectral bandwidth was 1 nm. Absorbances were 
not corrected for thermal expansion since the correction was 
linear and less than 3% from 0 to 90 °C. 

Determination of Thermodynamic Parameters. Thermo- 
dynamic parameters for duplex formation were obtained from 
melting curve data using the program MELTWIN v2.1 (Jeff 
McDowell and Douglas H. Turner, unpublished) as described 
(Petersheim & Turner, 1983). Data were truncated so that 
die upper and lower temperature baselines reflected the slopes 
in the transition region, generally using T u ± 30 °C 
(Petersheim & Turner, 1983). The root mean square 
difference between data and calculated curves is less than 
0.5%, the approximate error in the absorbance reading. The 
enthalpy and entropy for the random coil to duplex equilib- 
rium were obtained by two methods: (1) AH° and AS 0 from 
the fits of individual curves were averaged, and (2) plots of 
reciprocal melting temperature (jT m ~ ! ) versus the natural 
logarithm of the total strand concentration (In C T ) were fit 
to eq 1 (Borer et al., 1974): 

V 1 = R/AH° In C T + AS 0 / AH 0 ( I ) 

The T M is defined as the temperature at which half of the 
strands are in the double helical state and half are in die 
"random coil" state. For self-complementary oligonucleo- 
tides, the 7m for individual melting curves was calculated 
from the fitted parameters using 

r M = AH°/(AS° + R In C T ) (2) 

where R is the gas constant [1.987 cal/(K mol)], and the T M 



is given in K. For non-self-complementary molecules, CV 
in eqs 1 and 2 was replaced by Or/4. Both methods are 
essentially a van't Hoff analysis of the data, assuming the 
transition equilibrium involves only two states (i.e., duplex 
and random coil). We also assume that the difference in 
heat capacities (AC P ) of these states is zero (Petersheim & 
Turner, 1983; Freier et al, 1986b). These two methods 
depend differently on the two-state approximation. For a 
given oligonucleotide, agreement of parameters derived by 
the two methods is a necessary, but not sufficient, criterion 
to establish the validity of the two-state approximation 
(SantaLucia et al., 1990; Marky & Bresiauer, 1987). The 
methods described above have been shown to give thermo- 
dynamic results in good agreement with those obtained by 
calorimetry (Albergo et al., 1981). 

Choice of Sentiences. Sequences were designed or se- 
lected from the literature to meet the following criteria: (1) 
two-stale thermodynamics, (2) 7m between 20—65 D C to 
minimize extrapolation to 37 °C and allow the upper and 
lower temperature baselines to be adequately defined, (3) 
sequences with three or more consecutive guanine residues 
are not included since these sequences consistently yield 
lower than expected AH a values (Bresiauer et al., 1986), (4) 
sequences have terminal GC base pairs to minimize helix 
"fraying" that could invalidate the two-state approximation, 
and (5) the oligonucleotides were dissolved in 1 M NaCl so 
that length-dependent counterion condensation effects could 
be neglected (Record & Lohman, 1978; Olmsted et al., 1989). 
Sequences measured in this study were designed to meet the 
above criteria and provide uniform representation of the 10 
different nearest neighbors in the database. Throughout this 
paper nearest-neighbor base pairs are represented widi a slash 
separating strands in anliparallel orientation (e.g., AC/TG 
means 5'-AC-3' Watson-Crick base paired with 3'-TG-5'). 
The 10 nearest-neighbor sequences occur in this study with 
the following frequencies AA/TT = 43, ALT A = 21, TA/ 
AT =14, CA/GT = 28, GT/CA = 27, CA/GT = 23, GAJ 
CT = 36, CG/GC = 35, GC/CG = 33, and GG/CC = 29. 

Determination of Nearest-Neighbor Thermodynamic Pa- 
rameters. The total difference in the free energy of the 
folded and unfolded states of a DNA duplex can be 
approximated with a nearest-neighbor model: 

AG,(total) = X/ifjAG, + AG(init) + AG,(sym) (4) 

where each different oligonucleotide duplex is given the 
subscript /, AG; are the free energies for the 10 possible 
Watson— Crick nearest-neighbor stacking interactions [e.g., 
AGi = AG° 37 (AA/TT), AG 2 = AG° 37 (TA/AT), etc.], 
My is the number of occurrences of each nearest neighbor^", 
in each sequence, /, AG(init) is the initiation free energy, 
and AG,(sym) equals +0.4 kcai/mo! if duplex i is self- 
complementary and zero if it is non-self-complementary 
(Bailey & Monahan, 1978; Cantor & Schimmel, 1980). 

The thermodynamic results from 44 sequences with two- 
state transitions were used to construct appropriate matrices 
for input into the linear regression analysis. The [AG,(total) 
- AG/(sym)] formed the column matrix Gr„t and the 
standard errors in the AG^total), a it formed the column 
matrix a. The number of occurrences of each of the nearest 
neighbors, along with the initiation parameter formed the 
"stacking matrix", S with dimensions 44 x 11. The values 
of the 10 nearest neighbors and initiation, Gj, are unknown 
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Figure 1: Reciprocal melting temperature vs In C T plots for 
GTTGCAAC (•), GTACGTAC (a), GGACGTCC (0), and 
CGATATCG (■). 

and form the column matrix, Gnn. The data for all sequences 
is thus written: 

G Tu , - SG NN (5) 

The solution of eq 5 for the nearest neighbors, Gnn, was 
obtained using singular value decomposition (Press et al„ 
1989) which effectively minimizes the error weighted squares 
of the residuals (Bevington, 1969): 

X 3 = ^KAG, - SijAGjVcjf (6) 

Analogous calculations were performed to obtain nearest- 
neighbor parameters for AH° and AS 0 . All matrix manipu- 
lations were performed using the program MATHEMAT1CA 
(Wolfram, 1992). To verify our calculation methods, we 
derived the nearest-neighbor parameters for RNA and 
reproduced the literature values (Freier et al., 1986a). 

RESULTS 

Thermodynamic Data. All sequences in tills study dis- 
played monophasic melting transitions (data not shown) and 
showed concentration-dependent T M s, indicating complexes 
with molecularity greater than 1. Plots of 7" M _I versus In 
C T were linear (correlation coefficient >0.99) over the entire 
S0-100-foid range in concentration and are shown in Figure 
1 and Supporting Information (see paragraph at the end of 
the paper). Thermodynamic parameters derived from the 
average of fits of individual melting curves and from T M ~ l 
versus In C T are listed in Table 1. Those sequences in which 
the AH° from the two methods agree within 20% are listed 
in Table 1 as two-state transitions (SantaLucia et al., 1990; 
Marky & Breslauer, 1987). Those with differences in AH° 
greater than 20% are listed as non-two-state transitions. 
Experimental heat capacity differences (Table 1), AC P , were 
determined from the slope of AH 0 vs T M plots (data not 
shown), where the ArV°s and T M s are from the fitted curves 
of each oligonucleotide at different concentrations (Peter- 
sheim & Turner, 1983; Freier et al., 1986b). For two-state 
transitions, parameters derived from the average of the fits 
and from T M ~ ] vs In Or plots are equally reliable; thus the 
average of these parameters (Table 2) was used for the linear 
regression to determine nearest-neighbor increments (Table 
3). 

Nearest-Neighbor Parameters. Table 3 lists the nearest- 
neighbor thermodynamic parameters derived by multiple 
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linear regression from the data in Table 2. These parameters 
allow for self-consistent and accurate prediction of the 44 
sequences in Table 2 with two-state thermodynamics. 
Sequences with terminal T-A base pairs were not included 
in the regression analysis in order to minimize systematic 
errors due to terminal "fraying" in these sequences. We 
performed a control experiment where the eight sequences 
in Table 2 with terminal A-T base pairs were included in 
the regression analysis. The results showed that a poorer 
fit (as judged by the x 2 and Q parameters) was observed. 
This was particularly true for the AH 0 and AS° parameters. 
Inclusion of these sequences systematically made nearest 
neighbors with 3'-terminal T residues more stable than those 
with 5'-terminal T residues. For example, die AG° 37 s for 
AT/TA, TA/AT, GT/CA, TG/AC, CT/GA, and GT/CA 
nearest neighbors were -1.4S, -0.11, -1.76, -0.99, -1.53, 
and —1.10 kcal/mol, respectively, while the parameters for 
AA/TT, CG/GC, GC/CG, GG/CC, and initiation did not 
change compared to the values listed in Table 3. We find 
that the parameters in Table 3 can predict these sequences 
reasonably well if a penalty of +0.4 kcal/mol is assigned 
(for AG°37 and AH C ) for each terminal 5'-T'A-3' base pair. 
Note that sequences with terminal 5'-A-T-3' base pairs are 
not assigned this penalty. Apparently, sequences with 
terminal 5'-T-A-3' base pairs "fray" more than sequences 
with terminal 5'-A-T-3' base pairs. The parameter for 
terminal 5'-T-A-3' base pairs is included in Table 3. 

The Helix Initiation Parameter. The AG° 37 for helix 
initiation is +1.82 ± 0.24 kcal/mol (Table 3). This number 
applies to DNA duplexes with at least one G-C base pair 
and agrees reasonably well with the value of +2.3 kcal/mol 
determined previously (Pohl, 1974; Turner et al., 1990). 
Previous work indicated that initiation in sequences with only 
A-T base pairs is +3.4 kcal/mol (Scheffler et al., 1970; 
Turner et al., 1990). Thus, we have assumed the initiation 
at A-T pairs is +2.8 ± 1 kcal/mol (Table 3). This allows 
for the correct prediction of the AG° 37 for A 3 /T B (Table 2; 
Sugimoto et al„ 1991) and TTTTATAATAAA/AAAATAT- 
TATTT (Bolewska et al., 1984). 

The penalty for duplex initiation was assumed to be purely 
entropic for the reasons described previously (Freier et al., 
1986a). When the initiation enthalpy was allowed to vary 
in the regression analysis, a favorable value with a large error 
was observed (-7.2 ± 3.4 kcal/mol), and the enthalpy 
increments for the 10 nearest-neighbors are less favorable 
by 1.1 kcal/mol, on average. When the initiation entropy 
was allowed to vary in the regression analysis, a more 
unfavorable value was observed (-29.1 ± 12.4 eu), and the 
entropy increments for the 10 nearest-neighbors are more 
favorable by 3.5 eu, on average. 

Error Analysis. To evaluate the relative uncertainties in 
the nearest-neighbor parameters derived above, we deter- 
mined how the experimental errors, a,-, propagated to the 
errors in the nearest neighbors, a h as described [Press et al. 
(1989) eq 14.3.19]. In the regression analysis described 
above, the experimental data were weighted assuming 5%, 
10%, and 10% uncertainties in the AG° 37 , AH°, and AS°, 
respectively. The experimental uncertainties given in Table 
1 were not used because they reflect the experimental 
reproducibility of the data (i.e., precision) not the accuracy 
of the data (Bevington, 1969). Data from the literature were 
assigned the same percent errors. Assigning the same percent 
error for all the data effectively weights the data for all 
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Table 1- Thermod namic Parameters of 
_a_e_ lLrmo ynamic arame ers o 


Duplex Formation" 
















1/T M vs log C t 


parameicrs 






curve fit parameters 








—AW 


-AS 0 


Tu 


-AG 0 37 






—AC,, [kcal/ 


DNA duplex 


( -cal mol) 


(kcal/mol) 


(eu) 


(°C)" 


(kcal/mol) 


(keal/mol) 


(eu) 


(K mol)]' 






Two-State Transitions 












CCGG 


3.4 ± 0.3 


31.9 ± 1.1 


91.9 ±2.5 


16.7 


3.6 ±0.1 


29.4 ±1.0 


83.0 ±3.5 


0.2 


CGCO 


3.9 ± 0.4 


38.5 ± 1.6 


11 1.5 ±3.8 


23.6 


4.2 ± 0.2 


34.0 ± 1.7 


96.2 ± 6.0 


0.3 


GCGC 


4.3 ± 0.5 


45.6 ± 2.4 


133.3 ±6.1 


27.8 


4.5 ± 0-1 


35.8 ±0.9 


100.9 ±2.4 




CGATCG 


5.4 ± 0.4 


33.1 ± 1.1 


S9.3 ± 2.3 


34.3 


5.3 ± 0.2 


41.8 ±3.8 


117.9± 11.8 


0.5 


GACGTC 


5.6 ±1.3 


37.2 ±4.1 


101.8 ±9.2 


36.4 


5.5 ± 0.2 


46.6 ± 3.8 


132.6 ± 11.8 


0.5 


GCTAGC 


5.4 ± 0.6 


35.7 ± 1.8 


97.7 ± 3.9 


34.3 


5.3 ±0.1 


42.7 ± 3.9 


120.5 ± 12.2 




GGATCC 


5.1 ±0.2 


30.2 ± 0.6 


80.9 ± 1.3 


30.8 


4.9 ± 0.2 


39.4 ± 3.8 


11I.4± 11.8 




CAAGCTTG 


7.0 ± 0.2 


54.7 ± 0.7 


153.8 ± 1.8 


44.6 


7.0 ±0.1 


53.8 ±0.8 


150.9 ±2.5 




CATCGATG 


7.5 ± 0.6 


56.3 ± 2.3 


157.4 ±5.6 


47.4 


7.7 ±0.1 


62.0 ± 2.4 


175.1 ±7.4 




CGATATCG 


6.8 ± 0.2 


51.8 ±0.6 


145.1 ± 1.4 


44.0 


6.9 ±0.1 


55.7 ± 3.9 


157.1 ± 12.1 


0.5 


GAAGCTTC 


6.9 ± 0.2 


44.1 ± 0.8 


120.3 ± 1.8 


45.5 


7.1 ±0.2 


54.7 ±3.2 


153.5 ±9.8 


0.5 


GATCGATC 


7.5 ± 0.6 


P o± n 




47.7 


7.7 ± 0.3 


63.2 ± 5.9 


178.8 ± 17.9 


1.2 


GATGCATC 


7.2 ± 0.3 




144.2 ±2.5 


46.7 


7.4 ± 0.2 


60.3 ± 2.6 


170.4 ± 7.9 




GGAATTCC 


6.8 ± 0.2 


46.1 ± 0.5 


127.0 ± 1.2 


44.5 


6.8 ± 0.3 


57.0 ± 2.7 


161.8 ±8.3 


0.4 


GGACGTCC 


8.9 ± 0.2 


58.6 ± 0.8 


160.2 ± 1.8 


55.2 


9.1 ±0.1 


61.4 ±0.8 


168.7 ± 2.4 




GGAGCTCC 


8.6 ± 0.3 


53.4 ± 0.9 


144.6 ±2.1 


54.7 


8.8 ± 0.2 


60.1 ± 1.8 


165.4 ±5.1 




GTACGTAC 




51.3 ±0.6 


142.7 ± 1.5 


45.1 


7.1 ±0.2 


56.4 ± 3.7 


159.0 ± 1 1.4 


0.7 


GTAGCTAC 


7 n + n ~\ 


51.4 ±0.6 


143.3 ± 1.3 


44.9 


7.0 ±0.1 






0.5 


GTTGCAAC 




47.8 ± 1.6 


130.5 ±3.8 












CCATCGCTACC/GGTAGCGATGG 


n.5± 0^3 


86.9 ± 1.2 


236.8 ± 3.0 


63.9 




82.8 ± 1.8 


224.5 ± 5.2 




CCATTGCTACC/GGTAACGATGG 


12.3 ±0.3 


83.9 ± 1.4 


231.1 ±3.4 


59.7 


12.1 ± 0.1 


SI.l ±0.6 


222.4 ± 1.7 




CTGACAAGTGTC/GACTGTTCACAG 


13.0 ±0.6 


88.6 ± 2.8 


243.7 ± 6.9 


61.6 




74.3 ± 2.2 


200.6 ± 6.8 




CAT ATGGCC ATATG 


13.4 ±0.5 


93.7 ± 2.2 


25S.7 ± 5.6 


65.0 


12.0 ±0.3 


75.4 ± 2.4 


204.3 ± 7.0 








Non-Two 


-Slate Transitions 










GTATACCGGTATAC 


13.3 ±0.1 


101.7 ±0.7 


285.3 ± 1.9 


61.9 


11.3 ±0.2 


74.3 ± 0.7 


203.0 ± 1.9 




CATATTGGCCAATATG 


14.9 ± 1.1 


i 10.2 ±5.8 


307.4 ± 15.1 


65.3 


1 1 .6 ± 0.3 


67.3 ± 3.0 


179.7 ± 8.9 




GTATAACCGGTTATAC 


15.3 ±0.3 


113.6 ±1.7 


3 16.8 ±4.3 


65.9 


13.2 ±0.3 


85.8 ± 1.6 


234.0 ± 4.5 





" Listed by oligomer length and in alphabetical order. For self-complementary sequences, only the top strand is given. For non-sclf-complementary 
duplexes, both strands are given in antiparallei orientation separated by a slash. Solutions are 1 M NaCI, 10 mM sodium cacodylate, and 0.5 mM 
NaiEDTA, pH 7. Errors are standard deviations from the regression analysis of the melting data. Extra significant figures are given for AW and 
&S° to allow accurate calculation of AG° 37 and 7" K . '' Calculated for 10~ J M oligomer concentration for self-complementary sequencs and 4 x 10" J 
M for non-sclf-complementary sequences. r Only those sequences with a AW vs 7" M plot with a correlation coefficient greater than 0.8 are listed. 
Errors in AC,, are approximately 50%. 



sequences equally in the regression analysis. Thus, the 
percent error assumed has no effect on the values of die 
nearest-neighbor parameters obtained, only on the propagated 
errors in the parameters. For example, if we assume errors 
of 5% for A/V°, we obtain the same nearest-neighbor 
parameters as with 10% errors, but the error estimates for 
the nearest neighbors are half as large. The nearest-neighbor 
errors, Oj, given in Table 3 are the standard deviations that 
resulted from the propagation of experimental errors, a u 
during linear regression. The free-energy covariances (Press 
et al., 1989) between pairs of nearest neighbors are small 
(less than ±0.002 kcal : /mo! 2 covariance) and can be ne- 
glected. The initiation parameter, however, covaries with 
all 10 of the nearest neighbors (—0.006 kcal 2 /mol 2 covari- 
ance, on average). 

Goodness of the fits for the nearest neighbor parameters 
for AG°3 7 , AH°, and AS 0 were evaluated from the values of 
y 2 (eq 6) and the probability Q (Press et al., 1989). Q is the 
probability that a x 2 larger than that observed could be 
obtained by chance (larger Q indicates a better fit). Q 
probabilities greater than 0.001 are considered statistically 
acceptable (Press et al., 19S9). Q was calculated from the 
incomplete gamma function, gamma [v/2, yV2], where v is 
the number of degrees at freedom (v = number of inde- 
pendent measurements minus the number of parameters 
derived from the data) (Press et al., 19S9). Since measure- 
ments were made on 44 different oligonucleotides and 11 
parameters (10 nearest neighbors plus initiation) were 
determined, v is 33 (for AG D 37 ) or 34 (for AH 0 and AS° 



which do not have the initiation parameter floating). For 
AG° 37 , X 2 = 28.4 and Q = 0.70. For AH°, x 2 = 33.8 and Q 
= 0.47. For AS", f- = 50.7 and Q = 0.03. These results 
suggest that within the estimated errors the nearest-neighbor 
model is a valid description of DNA thermodynamics in 
agreement with previous results (Sugimoto et a!., 1994; 
Doktycz et al., 1995). 

Comparison of Experimental vs Predicted Thermodynam- 
ics. Table 2 compares the experimental results for 60 
oligonucleotides with those predicted using the nearest- 
neighbor parameters in Table 3. The 44 sequences that have 
two-state transitions are well predicted by the parameters in 
Table 3. For the T M at 0.1 mM, the largest difference is 4.6 
°C with an average deviation of 1.8 °C. The ability of the 
parameters in Table 3 to predict the T M accurately is 
encouraging, since the parameters were not specifically 
optimized for the prediction of the 7 M . The average 
deviations between experiment and prediction for AG° 37 , 
AH°, and AS° are 4%, 7%, and 8%, respectively. Previous 
results for RNA (Kierzek et al., 1986), DNA (Sugimoto et 
al., 1994), and DNA/RNA hybrid (Sugimoto et al., 1995) 
oligonucleotides with different sequences, but the same 
nearest neighbors, suggest tire nearest-neighbor model should 
be able to predict AG°n, AH a , and 7* M (at 0.1 mM) with 
average deviations of roughly 6%, 8%, and 2 °C, respec- 
tively. Thus, the predictive capacity of the parameters in 
Table 3 is within the limits of the nearest-neighbor model. 
The x 2 and Q parameters, described above, also suggest that 
the nearest-neighbor parameters given in Table 3 adequately 
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Table 2: Experimental and Predicted Thermodynamic Parameters of Duplex Formation" 



experimental predicted 







-AC°37 


-AH° 


-AS 0 






n ii 31 » 


aTp — 


— 777" 


— — 


sequence 


1* 
rei 


(kca!/mol) 


(kcai/mol) 


(eu) 


( <-.) 


tkcaJ/moi) 


(Kcal/molj 


(eu) 


( cy 




Molecules 


with Two-State Thermodynamics 












CCGG 




3.5 


30.6 


S7.4 


16.6 


3.4 


23.5 


64.0 


12.4 


CGCG 




4.0 


36.3 


103.9 


23.7 


4.2 


31.3 


86.7 


25.0 


GCGC 




4.4 


40.7 


117.! 


27.5 


4.4 


32.3 


89.6 


26.2 


CCGCGG 


d 


8.0 


41.4 


107.S 


55.2 


7.8 


44.7 


117.9 


55.0 


CGATCG 




5.3 


37.5 


103.6 


34.3 


5.6 


42.1 


117.7 


36.4 


CGCGCG 




S.3 


46.4 


122.8 


55.7 


8.6 


52.5 


140.6 


57.3 


CGGCCG 


d 


8.3 


3 S.7 


98.0 


59.6 


7.8 


44.7 


117.9 


55.0 


CGTACG 


/ 


5.4 


45.7 


130.0 


35.0 


5.4 


43.7 


122.8 


36.6 


GACGTC 




5.6 


41.9 


117.2 


36.1 


5.7 


42.7 


119.4 


36.9 


GCATGC 


S 


5.6 


42.2 


118.0 


36.5 


5.8 


45.5 


121.5 


38.0 


GCCGGC 


d 


8.5 


45.2 


118.3 


57.7 


8.0 


45.7 


120.8 


55.4 


GCGAGC/CGCTCG 


h 


7.7 


51.4 


124.0 


33.2 


7.5 


40.7 


115.1 


32.0 


GCGCGC 


e 


9.1 


59.6 


162.7 


56.1 


S.S 


53.5 


143.5 


57.5 


GCTAGC 




5.3 


39.2 


109.1 


34.3 


5.3 


40.7 


114.8 


32.6 


GGATCC 




5.0 


34.8 


96.2 


30.S 


5.0 


35.3 


97.9 


30.6 


GGCGCC 


d 


7.9 


43.5 


114.7 


53.9 


8.0 


45.7 


120.8 


55.4 


GGGACC/CCCTGG 


h 


6.5 


32.7 


84.5 


44.9 


6.4 


36.4 


96.0 


45.3 


GTGAAC/CACTTG 


h 


5.1 


43.6 


124.0 


33.2 


4.9 


40.7 


115.1 


32.0 


CAAAAAG/GTTTTTC 




4.8 


47.0 


136.0 


31.5 


4.S 


47.1 


135.7 


32.7 


CAAAAAAG/GTTTTTTC 


j 


5,7 


59.0 


172.0 


36.9 


5.8 


55.5 


159.3 


39.4 


CAAGCTTG 




7.0 


54.2 


152.3 


44.6 


7.2 


54.9 


153.7 


46.0 


CATCGATG 




7.6 


59.2 


166.3 


47.4 


7.0 


53.3 


149.6 


44.3 


CGATATCG 




6,9 


53.7 


151.1 


44.1 


6.9 


54.9 


155.0 


43.6 


CGTCGACG 


k 


9.8 


64.1 


175.1 


5S.3 


9.S 


62.9 


170.4 


60.2 


GAAGCTTC 




7.0 


49.4 


13G.9 


45.2 


7.3 


55.5 


155.7 


45.8 


GATCGATC 




7.6 


58.4 


163.7 


47.6 


7.2 


53.9 


151.6 


44.1 


GATGCATC 




7.3 


56.1 


157.3 


46.5 


7.2 


54.3 


152.5 


44.8 


GGAATTCC 




6.8 


51.6 


144.4 


43.8 


7.0 


52.1 


145.1 


45.7 


GGACGTCC 




9.0 


60.0 


164.5 


55.1 


9.2 


56.1 


150.6 


59.0 


GGAGCTCC 




S.7 


56.8 


155.0 


54.4 


8.8 


52.1 


139.7 


56.6 


GGTATACC 


k 


5.5 


54.5 


158.0 


36.0 


6.1 


49.7 


140.3 


40.2 


GTACGTAC 




7.1 


53.8 


150.9 


45.1 


6.8 


57.1 


161.8 


43.9 


GTAGCTAC 










44.9 


6.4 


53.1 


150.9 


40.7 


GTTGCAAC 




7.5 


53.8 


't« 2 - 


47.7 


7.7 


59.9 


167.5 


49.2 


CAAAAAAAG/GTTTTTTTC 


i 


7.2 


68.0 


196 D 


44.2 


6.8 


63.9 


182.9 


44.4 


CAAACAAAG/GTTTGTTTC 


i 


7.7 


64.5 




47.3 


7.6 


63.1 


178.0 


48.3 


CAAATAAAG/GTTTATTTC 




6.5 


58.6 


168 0 


41.4 


6.1 


59.9 


173.0 


40.0 


CAAAGAAAG/GTTTCTTTC 




7.3 


62.8 


179 0 


tlQ 




60.9 


172.1 


46.7 


GCGAATTCGC 


k 


12.9 


so.o 














CCATCGCTACC/GGTAGCGATGG 




13.3 


84.8 


^30 7 


67,6 


12.9 


77.2 


207.0 


69.5 


GCGAAAAGCG/CGCTTTTCGC 


l 


11.9 


74.2 


''Ol 0 




12.6 


81.4 


220.9 


67.2 


CCATTGCTACC/GGTAACGATGG 






82.5 


226.8 












CTGACAAGTGTC/GACTGTTCACAG 




12.6 


81.5 


222.! 


65.7 


12.9 


84.0 


229.2 


66.2 


CATATGGCCATATG 




12.7 


84.5 


231.5 


63.3 


13.2 


92.7 


256.3 


66.4 




Molecules will) Terminal A-T Base 


: Pairs'" 












TCATGA 


(I 


3.3 


50.4 


152.0 


22.8 


3.4 


35.9 


105.3 


17.3 


TGATCA 


(i 


2.8 


52.6 


160.6 












AAAAAAAA/TTTTTTTT 




4.5 


59.8 


178.2 


31.2 


3.9 


58.4 


171.1 


35.2 


TAGATCTA 


i! 


5.1 


49.2 


142.2 


33.4 


4.2 


45.9 


135.9 


24.5 


TCTATAGA 


d 


4.3 


45.7 


133.4 








135.9 


24.5 


ATGAGCTCAT 




10.0 


68.0 


1S7.0 


58.1 


9.5 


66.5 


184.7 


54.4 


TTTTATAATAAA/AAAATATTATTT 


P 


5.5 


75.6 


226.0 


36.3 


a' S 


81.5 


240.6 


41.6 


CAACTTGATATTATTA/GTTGAACTATAATAAT q 


12.4 


102.0 


2S9.0 


3S.8 


12.7 


10B.8 


310.2 


58.1 




Molecules with Non-Two- 


Stale Thermodynamic 












CCCGGG 




6.9 


62.0 


177.8 




7.0 


36.9 


95.2 


52.0 


CCCAGGG/GGGTCCC 




7.9 


63.0 


177,8 


48.1 


7.8 


40.3 


103.7 


57.2 


CGCGAATTCGCG 




20.6 


135.0 


369.0 


75.4 


16.4 


101.3 


272.7 


75.0 


CGCATGGGTACGC/GCGTACCCATGCG 




14.4 


101.0 


279.1 


66.5 


17.4 


100.6 


266.8 


79.7 


GTAT A CCGGTATAC 




12.3 


ss.o 


244,1 


62.2 


13.0 


96.1 


267.6 


63.0 


C ATATTGGCC A ATATG 




13.2 


88.8 


243.6 


65.9 


15.3 


109.5 


303.5 


67.1 


GTAT A ACCGGTTATAC 




14.3 


99.7 


275.4 


66.3 


15.0 


112.9 


314.8 


65.8 


CGCGTA CGCGTA CGCG 




29.1 


158.0 


415.6 


91.0 


24.1 


140.9 


374.5 


85.6 



° Listed by oligomer length and in alphabetical order. For self-complementary sequences only the top strand is given. For non-self-complementary 
sequences bolh strands are given in antiparallel orientation separated by a slash. * Sequences without a literature reference are from Table 1 of this 
work. * Calculated ibr M oligomer concentration for self-compiementaiy sequences and 4 x 10~ 4 M for non-self-complementary sequences. 
''Sugimoto et al. (1994). "Senior ct al. (1988). 'Breslauer (1986). 'Williams el al. (1989). ''Li and Agrawal (1995). 1 Aboul-ela et al. (1985). 
J Mordcn et al. (1983). 1 Breslauer et al. (1986). ' LeBlanc and Morden ( 1991 ). "' For each 5'-tcnninal T-A base pair, +0.4 kcal/moi is added to both 
All" and ACV "Sugimoto et al. (1991). " Li et al. (1991). Bolewska et al. (1984). Tibanyenda et al. (1984). r Arnold etal. (1987). ' Marky et 
al. (1983). 'Plum et al. (1992), " Raap el al. (1985). 
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Table 3: Thermodynamic Parameters for DNA Helix Initiation and 
Propagation in 1 M NaCi" 



propagation 
sequence 


A// 0 
(kcal/mol) 


A5° 
(eu) 




(kcal/mol) 




-rt 4 ns 




i 


-1.02 ± 0.04 


ATYTA 








—0.73 ± 0,05 






* 




—0.60 ± 0.05 


CAirr 


7 A + 1 


~W n + 




— 1.38 ± 0.06 


GT/CA 


-8 6 + 0 7 




5'n 




CT/OA 








— 1.16 ± 0.07 


GA/CT 


-77 ± 07 


-20.3 ± 


1.9 


-1.46 ±0.05 


CG/GC 


-10.1 ±0.9 


-25.5 ± 


2.3 


-2.09 ± 0.07 


GC/CG 


-11.1 ± 1.0 


-28.4 ± 


2.6 


-2.28 ± 0.08 


GG/CC 


-6.7 ± O.fi 


-15.6 ± 


1.5 


-1.77 ±0.06 


initiation atG-C 


(0) 


(-5.9 ± 


0.8) 


+ 1.82 ±0.24 


initiation atA-T 


(0) 


(-9.0 ± 


3.2) 


(+2.8 ±1) 


symmetry correction'' 
5'-terminal T-A bp 1 ' 


0 


-1.4 




+0.4 


+0.4 


0 




+0.4 



" Errors are standard deviations. Extra significant figures are given 
for AH" and AS° to allow accurate calculation of the 7 M . Values in 
parentheses involve assumptions about the initiation process (see text). 
* Initiation parameter for duplexes that contain at least one G-C base 
pair. r Initiation parameter for duplexes thai contain only A-T base pairs. 
'' Symmetry correction applies only to self-complementary sequences. 
''To account for end effects, duplexes are given the penalty listed for 
each terminal 5'-T-A-3' base pair. Note this penalty is not applied to 
sequences with terminal 5'-A-T-3' base pairs (see text). 



describe DNA thermal denaturation. 

Applicability to Non-Two-State Transitions. We believe 
the parameters in Table 3 apply to duplexes from 4 to 20 
base pairs. Beyond 20 base pairs, DNA transitions are 
unlikely to be two-state. Transitions that are not two-state 
require a statistical mechanical model for accurate predictions 
(Gralla & Crothers, 1973; Steger, 1994). Table 2 lists eight 
oligonucleotides that are not two-state. When the AG° 37 , 
AH°, and 7m of these oligomers are predicted with the 
parameters in Table 3, the average deviations of measured 
versus predicted values are 14%, 21%, and 4.7 D C, respec- 
tively. This suggests that the two-state model can also 
provide reasonable approximations for oligomers that do not 
have strictly two-state transitions. 

DISCUSSION 

Application of the Nearest-Neighbor Parameters. The 
nearest-neighbor model asserts that the free-energy for duplex 
formation is the sum of three terms: (1) an unfavorable 
entropy associated with the loss of translational freedom upon 
formation of the first hydrogen bonded base pair (i.e., the 
initiation free energy), (2) the sum of terms for the pairvvise 
interactions between base pairs, and (3) an entropic penalty 
(Bailey & Monahan, 1978; Cantor & Schimmel, 1980) for 
the maintenance of the C2 symmetry of sell-complementary 
duplexes (eq 4). Scheme 1 illustrates the calculation of 
AG° 37 for the sequence GCTAGC using the parameters in 
Table 3. Similarly, the predicted enthalpy change for 
GCTAGC is: Atf°(predicted) = 2(-ll.l) + 2(-6.1) + 
(-6.3) = -40.7 kcal/mol. The measured value is -39.2 
kcal/mol. Note that AW for initiation and symmetry are 
zero. The predicted entropy change for GCTAGC is 
AS°(predicted) = 2(-28.4) + 2(-16.1) + (-18.5) - 5.9 - 
1.4 = -1 14.8 eu. The measured value is -109.1 eu. The 
7" M is predicted at a given oligonucleotide concentration (Or/4 
for non-self-complementary sequences) using eq 2 along with 
the predicted AH° and AS°. For example, the predicted T M 
for GCTAGC at 0.1 mM is 7m (predicted) = (-40 700 call 



Scheme I . Prediction of AG°3 7 



i I I 




AG°„|predicted] = 2 AG°j, (GC/CGt + 2 4G 0 „|CT/GA) + 4G° 3 , (TO/ATI 
+ dG-jTlinit) + AG°j,(nyia| 
= 2 (-2,38) * 21-1.161 + (-0.60) + 1.B2 + D.4 
AG'S, (predicted) = -5.26 kcal/mol. 
AG°„ (observed} = -5.3 kcal/mol. 

mol)/(-114.S eu + 1.9S7 eu x In (1 x 10~ 4 )) = 305.8 K = 
32.6 °C. The measured value is 34.3 °C. Note that the units 
for AH° are kcal/mol and must be multiplied by 1000 to be 
consistent with AS° and the gas constant, R, which are in eu 
[caI/(K mol)]. 

Trends in the Nearest-Neighbor Parameters. The ob- 
served trend in nearest-neighbor stabilities at 37 °C is GC 
> CG > GG > GA sa GT « CA > CT > AA > AT > TA 
(where only the top strand is shown for each nearest 
neighbor). This trend suggests that both sequence and base 
composition are important determinants of DNA duplex 
stability. It has long been recognized that DNA stability 
depends of the percent G-C content (Marmur & Doty, 1962). 
The AG°37 parameters in Table 3 show that diere are 
significant sequence dependent contributions super- 
imposed on the general trend. On the other hand, the nearest- 
neighbor AH° parameters (Table 3) do not follow this trend. 
This suggests that stacking, hydrogen bonding, and other 
contributions to the AH° have a complicated sequence 
dependence. Perhaps, this is not surprising since it is well 
known that the detailed structure of DNA is profoundly 
dependent on sequence (Caliidine & Drew, 1984; Hunter, 
1993). 

The average of the A5°'s for the 10 nearest neighbor 
propagations is -20.9 eu. The agrees reasonably well with 
the sequence independent value of —24.85 ± 1.74 eu/base 
pair derived from polymers dissolved in 0.075 M Na + 
(Delcourt & Blake, 1991). Our results are also consistent 
with a simplistic calculation oF the conformational entropy 
(Cantor & Schimmel, 1980): 

A5 D C01ir = 2R In(3 x7x2x3x3x2) = 

—26.3 eu/base pair 

where the numbers inside the parentheses are the assumed 
number of possible conformations for the a, 0y (together), 
6, e, £, and x dihedral angles. The "2" in front of the gas 
constant is required because two residues must be constrained 
to form a base pair and propagate a helix. This calculation 
systematically overestimates AS 0 C0 „ r because many of the 
possible conformations would have high energies associated 
with them (due to steric repulsion). A more rigorous 
calculation would weight each of the possible conformations 
with a Boltzmann factor. This calculation also neglects salt 
effects and hydrophobic contributions to stacking (Hunter, 
1993). 

Comparison with Previous DNA Nearest-Neighbor Pa- 
rameters. Breslauer et al. (1986) derived nearest-neighbor 
parameters from a data set of 19 oligonucleotides (dissolved 
in 1 M NaCI) and nine polymers (dissolved in low salt with 
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results extrapolated to high salt) and assumed a value of 5.2 
kcal/mol for helix initiation (Borer et al., 1974), Our results 
share some similarities but aiso differ significantly from those 
of Breslauer and co-workers. Except for the GG/CC nearest 
neighbor, the AH° values of Breslauer et al. are within 2 
kca!/mol of those in Table 3 (for GG/CC Breslauer reports 
AW = -3 1 .0 kcal/mol, whereas we observe -6.7 kcal/mol). 
The trend for the stabilities of nearest neighbors with only 
AT base pairs is also similar with AA/TT > AT/TA > TAJ 
AT. On the other hand, the parameters of Breslauer et al. 
(1986) have the stability order CG/GC > GC/CG ^ GG/ 
CC, while GC/CG > CG/GC > GG/CC in our parameter 
set. Breslauer et al. also have CA/GT > AA/TT > GA/CT 
w CT/GA > GT/CA, while GA/CT « GT/CA ^ CA/GT > 
CT/GA > AA/TT in our parameter set. On average, 
Breslauer's parameters predict the AG° 37 , AH°, AS 0 , and 
7m of the two-state molecules given in Table 2 with average 
deviations of 16%, 12%, 13%, and 6.0 °C, respectively. As 
discussed above, the parameters in Table 3 predict die AG° 37 , 
AH a , AS", and T M of the two-state molecules given in Table 
2 with average deviations of 4%, 7%, 8%, and 1.8 °C, 
respectively. This comparison is somewhat biased since our 
parameters were optimized to predict our database, while 
Breslauer's parameters were derived from an independent 
data set. Quartin and Wetmur (1989) use essentially the 
same data as Breslauer et al. (1986) but assume a value of 
+2.2 kcal/mol for helix initiation (Pohl, 1974). These 
parameters predict the AG°i7 and the T M of the two-state 
molecules in Table 2 with average deviations of 10% and 
4.5 °C, respectively. 

Comparison with RNA Nearest-Neighbor Parameters. 
Our DNA parameters also differ significantly from RNA 
parameters measured by Freier et al. (1986a). This is not 
surprising, however, because RNA and DNA helices are 
known to have different structures (i.e., A-form vs B-form). 
Some DNA nearest neighbors are more stable while others 
are less stable than the analogous RNA nearest neighbors. 
For example, the DNA nearest neighbors AA/TT and CG/ 
GC are slightly more stable than the corresponding RNA 
nearest neighbors AA/UU and CG/GC. In all the other cases, 
DNA nearest neighbors are less stable than RNA, We also 
observe that DNA nearest neighbors with only C-G base pairs 
are less sequence dependent (largest difference = 0.51 kcal/ 
mol) than the corresponding RNA nearest neighbors (largest 
difference 1.4 kcal/mol). The relative order for stabilities 
of nearest neighbors with only A-T (DNA) or A-U (RNA) 
base pairs are also different with AA/TT > AT/TA > TA/ 
AT (DNA) vs UA/AU > AU/UA = AA/UU (RNA). 
Apparently, the relative stability of DNA and RNA duplexes 
depends on base sequence. 

The helix initiation parameter is more favorable in DNA 
(+1.82 kcal/mol) than in RNA (+3.4 kcal/mol). This is 
somewhat puzzling as DNA and RNA duplex initiation are 
expected to be similar since both require two strands to 
associate and reduce their translational and rotational degrees 
of freedom by forming a hydrogen-bonded base pair. Further 
work is required to elucidate the origin of this effect. 

Salt Dependence. The parameters in Table 3 apply to 
oligomers dissolved in 1 M NaCI at pl-l 7. To allow for 
approximate predictions in lower salt environments, we 
suggest the following preliminary equation: 



r M ^' = r M 1MNil +12.5 log [Nal (7) 

where 7" M 1M Na is T M predicted from Table 3 (1 M NaCI), 
and 7" M l Na l is the 7" M predicted at the desired sodium 
concentration. This correction for the 7m is in agreement 
with that determined previously (Erie et al., 1987; Rentz- 
eperis et al., 1993) for oligonucleotides but is somewhat 
smaller than that observed in polymers (Marmur & Doty, 
1962; Schildkraut & Lifson, 1965). Between 0.1 and 1 M 
NaCI, this correction predicts the 7*m of 26 sequences 
(dissolved in 0.1-0.3 M NaCI) from the literature (Aboul- 
ela et al., I9S5; Braunlin & Bloomfield, 1991; Gaffney & 
Jones, 1989; Kawase et al„ 19S6; Williams et al„ 1989; 
Lesnik & Freier, 1995) with an average deviation 3.5 °C 
(AHawi and SantaLucia, unpublished results). Below 0.1 
M, this correction is not reliable. This correction assumes 
that trends in nearest-neighbor stability are independent of 
salt concentration. Counterion-condensation theory suggests 
this assumption is reasonable since the salt behavior depends 
on the spacing between phosphates which should be rela- 
tively independent of sequence (Manning, 1978). However, 
this theory applies strictly to polymers and salt concentration 
below 0.1 M, and, for short oligonucleotides, the salt 
behavior may depend on oligonucleotide length (Record & 
Lohman, 1978; Olmsted et al., 1989). Two experimental 
studies, however, suggest that the salt behavior of oligomers 
is remarkably similar to that of polymers (Williams et al., 
1989; Brauniin & Bloomfield, 1991). While the above 
corrections were derived for sodium counterions, potassium 
counterions probably follow the same trend. The behavior 
of oligonucleotide thermodynamics in the presence of 
divalent cations, however, is likely to be more complicated. 
Previous work indicates that 1 M NaCI mimics 0.15 M NaCI/ 
10 mM MgCl 2 (Williams et al., 1989)-a condition similar 
to those commonly used in PCR reactions. Clearly, further 
work on the salt dependence of oligonucleotide thermal 
denaturation is required (Kumar, 1995). 
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ABSTRACT 

Nucleic acid double helices are stabilized by hydro- 
gen bonding and stacking forces (a combination of 
hydrophobic, dispersive and electrostatic forces) of 
the base pairs in the helix. One would predict the 
hydrogen bonding contributions to increase and 
the stacking contributions to decrease as the 
water activity in the medium decreases. Study of 
nucleobase paired duplexes in the absence of water 
and ultimately in pure aprotic, non-polar organic 
solvents is not possible with natural phosphodiester 
nucleic acids due to the ionic phosphate groups and 
the associated cations, but could be possible with 
non-ionic nucleic acid analogues or mimics such as 
peptide nucleic acids. We now report that peptide 
nucleic acid (PNA) (in contrast to DNA) duplexes 
show almost unaffected stability in up to 70% 
dimethylformamide (DMF) or dioxane, and extra- 
polation of the data to conditions of 100% organic 
solvents indicates only minor {or no) destabilization 
of the PNA duplexes. Our data indicate that stacking 
forces contribute little if at all to the duplex 
stability under these conditions. The differences in 
behaviour between the PNA and the DNA duplexes 
are attributed to the differences in hydration and 
counter ion release rather than to the differences in 
nucleobase interaction. These results support the 
possibility of having stable nucleobase paired 
double helices in organic solvents. 



INTRODUCTION 

Nucleic acid double heiices are stabilized by hydrogen 
bonding and stacking forces (a combination of hydro- 
phobic, dispersive and electrostatic forces) of the base 
pairs in the helix (1-5). The most recent data suggest that 
stacking interactions are the more important for DNA 
duplex stability (6,7). Indeed these results suggest that 
while hydrogen bonding in GC base pairs are stabilizing 



the helix, hydrogen bonding in AT base pairs are slightly 
destabilizing (6,7). One would predict the hydrogen 
bonding contributions to increase and the stacking 
contributions to decrease as the water activity in the 
medium decreases. Therefore, it could be or great interest 
both from a structural and thermodynamic as well as 
from a functional point of view to siudy nucleobase 
paired duplexes in the absence of water and ultimately in 
pure aprotic, non-polar organic solvents. 

This is not possible with natural phosphodiester nucleic 
acids due to the ionic phosphate groups and the 
associated cations, but could be possible with uncharged 
nucleic acid analogues or mimics such as methyiphos- 
phonates (S), phosphotrieslcrs (9), morpholino derivatives 
(10) and peptide nucleic acids (PNAs) (11,12). PNAs are 
DNA pscuclopeptide mimics capable of forming DNA- 
lifce Watson-Crick base-pair double helices with sequence 
complementary DNA, RNA and PNA oligomers (13-1S). 
As the aminoethyl glycine backbone of PNA is charge 
neutral, no counter ions are required for stabilizing 
PNA-PNA duplexes, and consequently such duplexes 
may more conveniently be studied in organic solvents. 

Results from a previous study has given preliminary 
indications that PNA-PNA duplexes are much less 
all'ected by the presence of organic co-solvent (50% 
DMF) than DNA-DNA duplexes, correlating well with 
the lack of measurable changes in hydration or counter 
ion binding upon PNA duplex formation (19). However, 
this observation warrants further exploration. We now 
report that PNA (in contrast to DNA) duplexes show 
almost, unaffected stability in 70% dimethylformamide 
(DMF) or dioxane, and extrapolation of the data to 
conditions of 100% organic solvents indicates only a little 
(or no) destabilization of the PNA duplexes. 



MATERIALS AND METHODS 
PNAs 

PNA1, PNA2, PNA3, PNA4 and PNA 5 were synthesized 
using solid phase Boc chemistry, purified by HPLC 
and characterized by MALDI-TOF mass spectrometry 
as described previously (20). PNA concentrations 



To whom correspondence should be addressed. Tel: +45 35 327762; Fax: +-15 35 396042; Email: peti@imbg.ku.dk 
© 21)1)7 Thi! Amhor(s) 

This is -in Open Acwss article distributed under die terms of Hie Creative Commons Attribution Kou-CominL-ici.il License (liijp;,',Vrviilivr.->jninions.cre,'liccn«s; 
by-iic/2.U/uk/) which permit* unrestricted imn-comiiierei.il use, distribution, and reproduction in any medium, provided the origiiul work is properly cited. 



336S Nucleic Acids Research, 2007, Vol. 35, No. JO 



were determined spectrophotometrically at 65°C 
using molar extinction coefficients: £260 of adenine = 
15400 M -1 cm"\ £ 2 go of guanine = 1 1 700 M~ ] cm" 1 , 
e->c 0 of thymine = 8800 M -1 cm -1 and £ 2 go of cytosine = 
7400 M"' cm- 1 . 

Chemicals and DNAs 

All chemical reagents used were of analytical grade except 
for dimethylformamide (DMF) and dioxane, which 
were spectroscopic grade from Sigma-Aldrich, Munich, 
Germany. The DNA oligonucleotides were purchased 
from DNA Technology, Aarhus, Denmark, and used 
without further purification. 

Sample preparation 

Main stock solutions of PNAs and DNAs were prepared 
by dissolution in deionized, double distilled water. 
Experimental samples were made by diluting from the 
corresponding main stock solutions in 10 mM phosphate 
buffer (pH 7.2) containing 100 mM NaCl and 0.1 mM 
EDTA. 

Equimolar mixtures (1:1 stoichiometry in single strands) 
of the PNA or DNA and its complementary strand were 
dissolved in the buffer mentioned above with desired 
amount of organic co-solvents. The duplex formation was 
assured by heating to 90°C and then cooling slowly to 
room temperature to allow proper annealing. No sign of 
aggregation or decreased solubility of the PNAs at up to 
70% of organic co-solvents was observed. 

UV-melting experiments 

The thermal melting experiments were performed on 
a Cary 300 Bio UV-visible spectrophotometer (Varian, 
Cary, NC, USA) connected to a temperature controller. 
Thermal melting profiles were obtained using heating- 
cooling cycles in the range of -3 to 95°C. The melting 
temperature (r m ) was determined from the peak of the first 
derivative of the heating curve. Cuvettes of 1.0 cm path 
length and 1 ,0 ml volume were used for all experiments. 

Thermal melting curves at >50% of DMF or dioxane 
start to lose the upper baseline and show severe 
disturbances partly because of high absorbance of DMF 
at the wavelength required for the experiments. Therefore, 
thermodynamic data at >50% of DMF could not be 
obtained. However, it was possible to obtain T m values at 
60 and 70% of DMF or dioxane. 



Then 



The thermodynamic parameters viz enthalpy change 
(All 0 ), entropy change (AS 0 ), and Gibbs' free energy 
change (AC 0 ) were evaluated using the 'hyperchromicity 
method' (curve fitting) and/or the concentration 
method (21). 

The hyperchromicity method 

The hyperchromicity method utilizes alpha curve and 
van't Hoff plots (In K r versus T~ l ) according to the 
following definitions (21): The fraction (a T ) of single 



strands that remained hybridized in the duplex at a 
particular temperature Tin Kelvin is represented as 



et T - 



A s - 



1 



where, A^ is the absorbance of the duplex in fully 
hybridized condition, /l s is the absorbance of the single 
strands in fully denatured condition and A s is absorbance 
at a particular point on the thermal melting curve at 
temperature T. 

For non-self-complementary sequences forming «-mer 
structures, the general equilibrium constant equation at a 
particular temperature T can be expressed as: 

r—^ ?L 



where, c ls represents the total concentration of strands and 
11 is the moiecularily of the complex. Assuming a two-state 
model, Equation (2) reduces to 



The van't Hoff plot In /vt versus T 1 is a straight line 
represented by 



*«-{-— )t + {— ) 



Hence, All 0 can be obtained from the slope and AS 0 
can be obtained from the ^-intercept of the van!t Hoff 
plot, AG 0 at a particular temperature Tin Kelvin can be 
calculated from 



AG 



-RT In Kj = All 



■ TAS 



where R is the universal eas constant which is equal to 
1 .986 cal/mol K. 



The concentration method 

The concentration method utilizes a plot of T~ l versus In 
f, s , where T m is the thermal melting temperature of the 
duplex and c, s is the total strand concentration of PNA or 
DNA. 

Since 7' m is defined by the temperature where a — 0.5 for 
a two state transition, combining Equations (3) and 
(4) yields: 



1 



r 



T m All 1 



In c 



AS" 



R In 4 



All ti 



Thus, the thermodynamic parameters can be 
extracted from a linear fit to a plot of 7^,' versus In c u 
according to Equation (6) (21), where All 0 is obtained 
from the slope of the linear fit and AS 0 from the Y- 
intercept. 

The values of the thermodynamic parameters calculated 
by this method are thus independent of strand concentra- 
tion, which is not the case with the hyperchromicity 
method described above. 
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RESULTS 

PNA and DNA duplexes 

In order to elucidate the properties of PNA duplexes as 
compared to those of iso-scquential DNA duplexes at 
reduced water activity, we have studied the effect of organic 
co-solvents on the thermal stability and thermodynamics 
of PNA-PNA, PNA-DNA and DNA-DNA duplexes of 
'random', mixed base-sequence (Table S 1). We chose DMF 
(dielectric constant of DMF is 36.7) and dioxane (dielectric 
constant of dioxane is 2.2) as organic co-solvents, as these 
are aprotic and are not hydrogen bond donors but still 
sufficiently polar to retain solubility of the PNA-DNA 
complexes even above 50% organic solvent. 

The thermal stability of these duplexes and the 
corresponding thermodynamic parameters [evaluated 
using both the concentration method (7"~' versus lnc la 
plot) (21) and the hyperchromia ty (curve fitting) method 
(21)] in aqueous medium with increasing amount of DMF 
(extrapolated to 100% DMF) are presented in Tables 1 
and S2. Representative thermal melting curves are shown 



in Figures SI and S2. It is important to note that thermal 
denaluralion curves showed essentially unperturbed 
monophasic behaviour up to 50% organic co-solvent. 
The extrapolation was performed on the basis of the linear 
plots of the thermal stability (7* m ) and Gibbs' free energy 
changes (AG U ) as a function of increasing amount of 
DMF in the medium (Figure 1A and B). Because of 
insufficient thermal stability of the DNA1-DNA2 duplex 
in >30% DMF, we also designed a longer DNA-DNA 
duplex (DNA3-DNA4), that has thermal stability in 
aqueous solution similar to that of the PNA1-PNA2 
duplex (data in Tables 1 and S3, Figure 1A and B, 
representative thermal curves in Figure S3). While the 
organic solvent-dependent changes in AG U of all these 
duplexes show a very good correlation with the corre- 
sponding T m , A/7° and AS 0 values were not significantly 
a Heeled within experimental error (Tables S2-S4). The 
extraction of thermodynamic parameters derived 
from thermal melting method requires that no change in 
heat capacity occurs in the duplex single-strand equili- 
brium. For the present systems we have found only 



Table 1. Thermal stability and thermodynamic parameters of PNA and DNA duplexes" 



Duplex DM1- 1 ' (%) T m ( Cf' 1 &C,% (kcal/mol)* 1 -* AAG U ' 



PNA1-PNA2 


0 


70.2 ±0.3 


-16.6 ±0.5 (-17.2) 


5.6 




10 


68.3 ±0.2 


-15.4 ±0.5 (-I6.2J 






20 


66.2 ±0.2 


-14.3±0.4 (-15.6) 






30 


64.3 ±0.5 


-14.5±0.S (-15.7) 






40 


63.4 ±0.3 


-14.4 ±0.5 (— is.yj 






50 


61.3 ±0.4 


-13.6 ±0.7 (-15.5) 






SO 


60.1) ± 0.6" 






70 


56.1 ± 0.5 s 








too 1 


52.6 


-11.0 




DNA1-DNA2 


0 


35.8 ±0.5 


-S.2±0.7 (-7.9) 


15.0 




10 


29.2 ±0.4 


-6.2 ±1.2 (-6.1) 






20 


23.7 ±0.5 


-4.9±0.8 (-4.9) 






30 1 


18.2 ±0.3 


-3.7±0.5 (-3.9) 






100' 




6.8 




DNA3DNA4 


0 


70.0 ± 0.4 


-16.4 ±0.4 


19.2 




10 


62.1 ±0.5 


-14.3 ±0.7 






20 


56.2 ±0.6 


-12.8±0.B 






30 


49.2 ±0.3 


-11. 3± 1.0 






40 


41.3 ±0.2 


-9.3 ±0.6 






50 


30.1 ±0.5 


-6.1 ±0.5 






100' 




2.8 




PNAI-DNA2 


0 


5 1.3 ±0.4 


-10.3 ± 1.0 


6.8 




10 


47.0 ±0.5 


-9.5±0.7 






20 


43.1 ±0.3 


-9.0 ±0.6 






30 


40.2 ±0.4 


-8.3 ±0.6 






40 


36.2 ±0.5 


-7.7 ±0.8 






50 


31.! ±0.2 


-6.7±0.5 






100' 


12.3 


- -3.5 





"PNA and DNA sequences: H-GTA GAT CAC T-Lys-NH, (PNA1); H-AGT GAT CPA C-Lys-NH, (PNA2); 5'-GTA GAT CAC T-3' (DNA 1)- 5'-AGT 
GAT CTA C-3' (DNA2); 5'-AGT GAT CTA COG TGG ACG GTC C-3' (DNA 3); 5 -GGA CCG TCC ACC G'l'A GAT CAC T-3' (DNA4). 
''Vol":, in WmM phosphate buffer containing 100 mM NaCI and 0.1 mM EDTA, pH 7.2 ±0.01. 
'"Duplex concentration* of 5.0 pM in strands were used (T m plots arc in Figures I A. S4A and S5A). 
■"Standard deviations are based on live independent measurements. 

'Evaluated from the hypcrchromicity (curve lilting) method (21) at 37X (AC 0 ,, plots are in Figures IB, S413 and S5I3). Values obtained from the 
concentration method (21) (Equation 6) are shown in paratheses (lull details of these data are in Supplementary Data). 
r AAC" = AG" 37 (DMl-)-AG ! ' 37 (aqueous). Calculated only with the values obtained from the hyperchromieity method. 
These data have poorer accuracy due to upper baseline irregularities. 
''Data could not be evaluated because of bad thermal curves. 

|Data were obtained from manual extrapolation of the linear plots in Figure IA and B. 

J Data obtained at higher than 30% of DMF were not reliable because of too low values of T m . 
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0 10 20 30 40 50 60 70 
DMF or formamide (%) 

Figure 1. Plots of (A) T m and (I!) AC" of PNA1-PNA2 (solid diamond), 
PNA1-DNA2 (solid xipiure), DNA1-DNA2 (open xipiure) and 
DNA3-DNA4 (wlid triable) as a function uf die amount of DMF in 
the medium. (C) Plots of T m of PNA1-PNA2 (open iritmgle) and 
DNA3-DNA4 (open mpnire), as a function or the nmounl of formamide 
in the medium, compared to tiiat of PNA1-PNA2 (dashed line) and 
DNA3-DNA4 (dolled line), as a function of the amount of DMF in the 
medium (DMF data taken from Figure 1A). The aqueous buifer was 
10 mM phosphate buffer containing 100 mM NaCl and 0.1 mM EDTA, 
pH 7.2±0.0i (dala in Tables 1, 52-S5). 



minor changes. The values of specific heat capacity change 
(enthalpic) at constant pressure (AC,,) were I.Okcal/ 
mol.K for PNA1-PNA2, l.4kcal/mol.K for 
PNA1-DNA2, ! .9 kcal/mol.K for PNA2-DNA! and 
l.Skcal/mol.K for DNA1-DNA2 (19). 

It is clearly evident that the presence of DMF has a 
much smaller effect on the thermal stability and free 
energy of the PNA-PNA duplex than on the DNA-DNA 
duplex, whereas the effect on the PNA-DNA duplex is 
intermediate. The plots in Figure 1A and B show a linear 
dependence or both T m and AC 0 on the DMF concentra- 
tion. The ciiange in thermal stabilities as a function of the 



amount of DMF in the medium did not deviate within 
experimental error from linearity up to 70% of DMF, 
thereby supporting a hypothetical approximation beyond 
70% of DMF (and an extrapolation of the data to 100% 
DM! 7 ) (Figure I A and B, Table I). These results 
strongly indicate that PNA-PNA duplexes in contrast to 
DNA-DNA (and PNA-DNA) duplexes will have appre- 
ciable stability even in the absence of water. Naturally, 
we cannot exclude that non-linearity might occur at very 
low water contents. 

Analogous studies of PNAI-PNA2, PNA1-DNA2 and 
DNA3-DNA4 in dioxane show that, despile ihe significant 
difference between the dielectric constants of these 
solvents, there is no significant difference between the 
effects of DMF and dioxane on T m and AG 0 (Tables S3 
and S4, representative thermal curves in Figures S2 and 
S3, plots in Figures S4 and S5). 

PNA suul DNA duplexes in formamule 

In order to compare the effect of aprotic solvent (DMF 
and dioxane, where the destabilization of the duplex is 
assumed to be predominantly caused by dehydration (and 
perhaps change in dielectrics) with that of a hydrogen- 
bond donor (and breaking) solvent, we studied the 
thermal and thermodynamic properties of PNA I -PNA2 
and DNA3-DNA4 in formamide (dielectric constant of 
formamide is 109), which is a well-established nucleic acid 
denaturant (Table S5, representative thermal curves in 
Figure S6, plots in Figures 1C, S7 and SS). In formamide, 
the destabilization is caused by a combined effect of 
H-bond disruption and dehydration. Notably, the desta- 
bilizing effect of formamide is almost as pronounced for 
the PNA duplex as for the DNA duplex. The slopes of the 
linear plots of T m of PNA1-PNA2 and DNA3-DNA4 as a 
function of increasing amount of DMF in the medium are 
-0,17 (without taking the values at 60 and 70% into 
account) and —0.77, respectively, whereas, those values in 
formamide are -0.40 and -0.61, respectively (Figure 1C). 
On the other hand, the decrease in 7!,, is paralleled by the 
increases in both AC U and Alf. Thus in stark contrast 
to the effects of DMF and dioxane, formamide has 
comparable cfi'ects on PNA and DNA duplex stabilities. 

Self-cuninleinentary PNA and DNA hairpins 

Additionally, we examined the effect of organic co-solvent 
on the stability of self-complementary (foldback) 
PNA hairpins (Figure 2). UnfoldinE of hairpin 
PNAs, H-AGAG-(egl) r CTCT-Lys-NH 2 (PNA3) and 
H-ACAG-(egl) r CTGT-Lys-NH 2 (PNA4) also showed 
little change in T m and AG IJ with increasing amount of 
DMF or dioxane in the medium (Tables S6 and S7, 
Figures 2A, 13 and S9). In fact, the hairpins were slightly 
stabilized in dioxane (Table S7 and Figure 2B), and in this 
case the stabilization appears to be cnihalpic (Table S7). 
Similar studies with an analogous DNA hairpin (DNA5: 
5'-AG AGTTTTCTCT-3 ') in DMF and dioxane as a 
control showed a dramatic thermal destabilization 
by both co-solvents (Tables SS and S9, Figures 2C, 
D and S10). 
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0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 SO 90 100 

DMF (%) Dioxane (%) 

Figure 2. Plots or T„ a( SQff-CQmplementury (hairpin) PNAs I'NA3 (w/trf inverted /rumple) and PNA4 {solid circle) us a Amnion or the amount of 
(A) DMF lind (11) dioxane in the medium. Plots or T m of hairpin DNA com ml DNA5 as a function or the amount or (C) DM F (open triangle) 
and (D) dioxane {open siptniv) in lite medium. The aqueous buffer was iOmM phosphate buffer containing 100 mM NaCi and 0.1 mM EDTA, 
pl-l 7.2 ±0.0 1 (PNA3: H-AGAO-(cEl) 3 -CTCT-Lys-N[-l2, PNA4: H-ACAG-(egl),-CTGT-Lvs-NI-I->, DNA5: 5'-AGA GTf TTC TCT-3') (data in 
Tables S6-S9). 



PNA duplex with an endstacked tricyclic thymine 

In order to evaluate the effccL of removal of water from 
the medium (by increasing DMF content) on base 
stacking forces more directly, we studied the effect of 
DMF on the behaviour of a PNA duplex (PNA5-PNA6) 
containing a tricyclic thymine analogue (benzof/?]- 
l,S-naphthyridin-2(l//)-one, designated as tT) (22) att- 
ached at the end of one strand, thereby stabilizing the 
duplex predominantly (or entirely) by end-slacking (22). 
It is evident that, DMF preferentially destabilizes 
the PNA5-PNA6 duplex (T m slope is -0.24) compared 
to the control PNA6-PNA7 duplex (T m slope is -0.15) 
(Table S10, Figures 3 and Sll). Most interestingly, the 
stabilization of the duplex contributed by the tT base is 
completely lost at 70% DMF (Figure 3), clearly indicating 
a dramatic reduction of the contribution of slacking 
interactions lo duplex stability under these conditions 
(decrease in thermal stability of PNA5-PN A6 at 70% 
DMF with respect to 0% DMF, Ar m =!8.S, whereas, 
that of the control duplex PNA6-PNA7 is only 10.7). 

Mismatched duplexes 

Sequence discrimination is critically dependent on hydro- 
gen bonding recognition, but base-pair mismatches may 
also change the geometry of the helix and thus influence 
slacking interactions. For instance, X-ray crystallography 
and NMR studies have revealed that G-T and 



90 



BO 




50 



40-1 1 1 1 1 1 , , 

0 10 20 30 40 50 60 70 80 
DMF (%) 

Figure 3. Plots of T m of PNA5-PNAG (open inverted triangle) 
containing tricyclic thymine {tT) and its control PNA6-PNA7 
{open circle), as a junction of the amount of DMF in the medium 
(data in Table SID). The aqueous buffer was IOmM phosphate buffer 
containing lOOmM NaC! and O.ImM EDTA, pl-l 7.2 ±0.01 {PNA5: 
li-tT-GTA GAT CAC T-NH,, PNA6: ll-AGT GAT CTA C-NH-,, 
PNA7: H-GTA GAT CAC T-NH-.). 



TT mismatches in DNA likely form wobble base pairs 
(23-30). The homopurine A-A mismatch in DNA involves 
one hydrogen bond between the amino group of one 
adenine and the nitrogen NI of the opposite adenine, 
whereas, the mispaircd TT bases in DNA involve two 
imino-carbonyl hydrogen bonds (23-30). A CT mismatch 
is the most unstable in DNA and makes one regular 
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H-bond involving carbonyl groups along with one weak 
H-bond bridged by a water molecule at neutral pH 
(23-30). Effects of mispairs are rather localized and the 
duplex retains a global B-form conformation (23-30). 
As hydration and stacking forces could be more important 
factors for mismatch stabilization, removal of water may 
preferentially destabilize these. 

However, surprisingly we find that the relative effects of 
DMFon the stability of single base mismatched (T-T, A-A 
and C-T) duplexes of PNA8 (H-AGT GTT CTA C-Lys- 
NHt), PNA9 (H-GTA GAA CAC T-Lys-NHi) and 
PNA 10 (H-GTA GCT CAC T-Lys-NH-,} with PNA1 
and PNA2 are not distinguishable within experimental 
error from that observed with the corresponding fully 
matched PNA duplexes (The T m slope of the fully 
matched PNAI-PNA2 is -0.17, of T-T mismatched 
PNAI-PNA8 is -0.13. of A A mismatched FNA2-PNA9 
is -0.08, and of C-T mismatched PNA2-PNAI0 is -0.12) 
(Tables 1 and SI 1, Figures I A, 4, SI2 and S13). These 
results suggest that, in these base pair mismatched 
duplexes, hydrogen bonding and slacking interactions 
are equally compromised or that the mismatched base 
pair contributes only very little to the overall duplex 
stability. 



A too 




DISCUSSIONS AND CONCLUSIONS 

The relative inscnsilivity of PNA duplexes to the reduced 
water content is consistent with our previous observations 
showing that in contrast to the case of DNA-DNA (and to 
a lesser extent PNA-DNA duplexes), no change in 
hydration occurs upon duplex formation (19) and no 
change in counter ion binding is observed either (19). 

Thus tiie effect of organic solvents on PNA duplex 
stability may be ascribed to changes in base-pair hydrogen 
bonding and stacking forces of the base pairs. Stacking 
forces have contributions from hydrophobic, dispersion 
and dipoie electrostatic forces (31-33), and therefore both 
hydrogen bonding as well as stacking interactions arc 
affected by the dielectric constant of the solvent. However, 
we observe no correlation between PNA duplex stability 
and solvent dielectric constant. Water, formamide, DMF 
and dioxane have dielectric constants of 78, 109, 37 and 
2.2, respectively, and in particular the comparable effect of 
dioxane and DMF strongly argue against a simple 
dependence of the dielectric constant. On the other 
hand, it is clear that the contribution of hydrogen bonding 
to PNA duplex stability should significantly increase when 
the dielectric constant and especially the hydrogen 
bonding donor/acceptor activity of the solvent decreases. 
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smutched PNA duplexes in DMF (data in Table SI I). Plots of (A) T m (solid dhmwml) and 
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(It should be kept in mind that the hydrogen bond 
stabilization results as the difference between nucleobase 
hydrogen bonding to solvent versus base pairing.) 
Furthermore, our results very clearly show that the 
stabilizing effect of end stacking decreases upon addition 
of DM1- to the solvent. Indeed, the end-stacking effect 
using a tricyclic thymine analogue is absent already at 
70% DMF. This thymine analogue in essence contains an 
additional phenyl ring compared to thymine (22), and 
therefore the stacking interactions of the tricyclic thymine 
are expected to contain a relatively larger hydrophobic 
contribution than natural nucleobase pairs (32). Thus a 
similar effect on the natural nuclcobases is expected to 
occur at higher DMF (lower water) content. Nonetheless, 
our results strongly support the contention that relative 
helix stabilizing contribution of base-pair hydrogen 
bonding and base stacking will be shifting towards 
hydrogen bonding as the water content of the medium 
(as well as the dielectric constant) decreases. In this 
context, we note that for the PNA hairpins an increased 
stability is indicated with increasing concentration of 
the low dielectric constant solvent dioxane (in contrast 
to DMF). 

It is well established that formamide and urea destabi- 
lize DNA and RNA duplexes (34,35), and in agreement 
with the present data, the effect of DMF and dioxane 
(at I0mol%) on RNA duplex stability was found to be 
comparable to that of formamide (33). However, also in 
case or RNA, no direct correlation between the duplex 
stabilizing effect and the solvent dielectric constant was 
apparent (33). Indeed, from our data comparing the 
effects of formamide and DMF on PNA duplex stability, 
we would argue that the effects of organic solvents on 
DNA and RNA duplexes is to a larger extent due to 
hydration (solvation) and counterfoil effects, whereas the 
effects on PNA duplexes more directly reflect changes in 
hydrogen bonding and stacking contributions to helix 
stability. Thus more systematic studies on the different 
behaviour of PNA versus DNA duplexes could possibly 
shed light on these relative contributions to nucleobase 
(including DNA) helix stability. 

Interestingly, the data (Tables S2 and S3) could hint 
that the decreased stability of the DNA duplexes upon 
removal of water appears to be primarily cntropic rather 
than enlhalpic, though the data are not conclusive. This 
would be compatible with the differences in behaviour 
between the PNA and the DNA duplexes (as mentioned 
above) being caused by the differences in hydration and 
counter ion release rather than via nucleobase interaction. 
Thus, these results can be of great significance in under- 
standing nucleobase pairing systems, and it will be of great 
interest to learn whether other non-ionic DNA analogues 
such as methylphosphonates (S), phosphotriester (9) and 
morpholino (10) derivatives behave similarly. 

The possibility of having stable nucleobase paired 
double helices in organic solvents has several implications. 
First of all, we note that it should be possible to reach the 
goal of finding PNA duplexes that are freely soluble in 
non-polar organic solvents by chemically modifying the 
PNA backbone, for instance using hydrophobic amino 
acids in place of glycine (36). Studying the behaviour of 



these in the absence of water could provide valuable 
information about fundamental properties of nucleobase 
double helices. Furthermore, PNA duplexes spanning 
lipid membranes could be envisaged as a means for 
mediating electron transport over the membrane. 
Sequence-dependent electron transport (or hole migra- 
tion) through DNA duplexes is now well established 
(37-41), and although electron transport involving PNA 
still require more extensive studies (42,43) this should be 
worthwhile pursuing. Finally, entirely new dimensions 
would be added to the emerging technique of 'DNA 
sequence' directed organic synthesis (44). Thus novel 
avenues of exploiting nucleobase recognition systems can 
be enabled by transferring these to non-aqueous 
environments. 
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